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MODIFIED BACILLUS THURINGIENSIS 
INSECTICIDAL- CRYSTAL PROTEIN GENES AND THEIR 
EXPRESSION IN ptant CTT.T.fi 

This invention provides a modified Bacillus 
thuringiensis ("Bt n ) gene (the "modified BtlCP gene") 
encoding all or an insecticidally-effective portion of 
a Bt insect icidal crystal protein ("ICP") . A plant, 
transformed with the modified Bt ICP gene can show 
higher expression levels of the encoded ICP and 
improved insect-resistance. 

Background of the Invention 

Plant genetic engineering technology has made 
significant progress during the last 10 years. It has 
become possible to introduce stably foreign genes into 
plants. This has provided exciting opportunities for 
modern agriculture. Derivatives of the Ti-plasmid of 
the plant pathogen, Agrobacterium tumef aciens , have 
proven to be efficient and highly versatile vehicles 
for the introduction of foreign genes into plants and 
plant cells. In addition, a variety of free DNA 
delivery methods, such as electroporation, 
microinjection, pollen-mediated gene transfer and 
particle gun technology, have been developed for the 
same purpose* 

The major aim of plant transformations by genetic 
engineering has been crop improvement. In an initial 
phase, research has been focused on the engineering 
into plants of useful traits such as insect-resistance. 
In — this — respect f _progres s in engineering insect 
resistance in transgenic plants has been obtained 
through the use of genes, encoding ICPs, from Bt 
strains (Vaeck et al., 1987). A Bt strain is a spore 
forming gram-positive bacterium that produces a 



parasporal crystal which is composed f crystal 
proteins which are specifically toxic against insect 
larvae. Bt ICPs possess a specific insecticidal 
spectrum and display no toxicity towards other animals 
and humans (Gasser and Fraley, 1989). Therefore, the Bt 
ICP genes are highly suited for plant engineering 
purposes. 

For more than 20 years, Bt crystal spore 
preparations have been used as biological insecticides. 
The commercial use of Bt sprays has however been 
limited by high production costs and the instability of 
crystal proteins when exposed in the field (Vaeck et 
al., 1987). The heterogeneity of Bt strains has been 
well documented. Strains active against Lepidoptera 
(Dulmage et al., 1981), Diptera (Goldberg and Margalit, 
1977) and Coleoptera (Krieg et al., 1983) have been 
described. 

Bt strains produce endogenous crystals upon 
sporulation. Upon ingestion by insect larvae, the 
crystals are solubilized in the alkaline environment of 
the insect midgut giving rise to a protoxin which is 
subsequently proteolytically converted into a toxic 
core fragment or toxin of 60-70 kDa. The toxin causes 
cytolysis of the epithelial midgut cells. The 
specificity of Bt ICPs can be determined by their 
interaction with high-affinity binding sites present on 
insects' midgut epithelia. 

The identification of Bt ICPs and the cloning and 
sequencing of Bt ICP genes has been reviewed by Hofte 
and Whiteley (198 9) . The Bt ICP genes share a number of 
common properties. They generally encode insectlcidar 
proteins of 130 kDa to 140 kDa or of about 70 kDa, 
which contain toxic fragments of 60 ± 10 kDa (Hofte and 
Whiteley, 1989). The Bt ICP genes have been classified 
into four major groups according to both their 
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structural similariti s and insecticidal spectra (Hofte 
and Whiteley, 1989): Lepidoptera-specific (Cryl) , 
Lepidoptera- and Diptera-specific (Cryll), coleoptera- 
specific (Crylll) and Diptera-specific (Cry IV) genes. 
5 The Lepidoptera-specific genes (Cryl) all encode 
130-140 kDa proteins. These proteins are generally 
synthesized as protoxins. The toxic domain is localized 
in the N-terminal half of the protoxin. Deletion 
analysis of several Cryl genes confirm that 3" portions 

10 of the protoxins are not absolutely required for toxic 
activity (Schnepf et al., 1989). Cry II genes encode 65 
kDa proteins (Widner and Whiteley, 1985). The Cry II A 
proteins are toxic against both Lepidoptera and Diptera 
while the Cry II B proteins are toxic only to 

15 Lepidopteran insects. The Coleoptera-specif ic genes 
(Cry III) generally encode proteins with a molecular 
weight of about 70 kDa. (Whiteley and Hofte, 1989). The 
corresponding gene (cry III A) expressed in E^ coli 
directs the synthesis of a 72 kDa protein which is 

20 toxic for the Colorado potato beetle. This 72 kDa 
protein is processed to a 66 kDa protein by spore- 
associated bacterial proteases which remove the first 
57 N-terminal amino acids (Mc Pherson et al., 1988). 
Deletion analysis demonstrated that this type of gene 

25 cannot be truncated at its 3' -end without the loss of 
toxic activity (Hofte and Whiteley, 1989). Recently, an 
anti-coleopteran strain, which produces a 130 kDa, 
protein has also been described (European patent 
application ("EPA") 89400428.2). The cry IV class of 

30 crystal protein genes is composed of a heterogenous 
group of Diptera-specific crystal protein genes (Hofte 
and Whiteley, 1989) . 

The feasibility of generating insect-resistant 
transgenic crops by using Bt ICPs has been 
demonstrated. (Vaeck et al., 1987 ; Fischoff et al., 



1987 and Barton et al., 1987). Transgenic plants ff r 
an attractive alternative and provide an entirely new 
approach to insect control in agriculture which is at 
the sane time safe, environmentally attractive and 
cost-effective. (Heeusen and Warren, 1989) . Successful 
insect control has been observed under field conditions 
(Delannay et al., 1989 ; Heeusen and Warren, 1989). 

In all cases, Agrobacterium -mediated gene transfer 
has been used to express chimaeric Bt ICP genes in 
plants (Vaeck et al., 1987; Barton et al., 1987; 
Fischoff et al., 1987). Bt ICP genes were placed under 
the control of a strong promoter capable of directing 
gene expression in plant cells. It is however 
remarkable that expression levels in plant cells were 
high enough only to obtain insect-killing levels of Bt 
ICP genes when truncated genes were used (Vaeck et al . , 
1987; Barton et al., 1987). None of the transgenic 
plants containing a full-length Bt ICP gene produced 
insect-killing activity. Moreover, Barton et al. (1987) 
showed that tobacco calli transformed with the entire 
Bt ICP coding sequence became necrotic and died. These 
results indicate that the Bt ICP gene presents unusual 
problems that must be overcome to obtain significant 
levels of expression in plants. Even, when using a 
truncated Bt ICP gene for plant transformation, the 
steady state levels of Bt ICP mRNA obtained in 
transgenic plants are very low relative to levels 
produced by both an adjacent NPT 1 1 -gene, used as a 
marker, and by other chimeric genes (Barton et al., 
l987-:___Vaeck et al. , 1987 ). Moreover, the Bt ICP mRNA 
cannot be detected by northern blot analysis. Similar 
observations were made by Fischoff et al, (1987); they 
reported that the level of Bt ICP mRNA was much lower 
than expected for a chimeric gene expressed from the 
CaMV35S promoter. In other words, the cytoplasmic 
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accumulation of the bt mRNA, and consequently the 
synthesis, the accumulation and thereby the expression 
of the Bt ICP protein in plant cells, are extremely 
inefficient. By contrast, in microorganisms, it has 
5 been shown that truncated Bt ICP genes are less 
favorable than full-length genes (Adang et al., 1985), 
indicating that the inefficient expression is solely 
related to the heterologous expression of Bt ICP genes 
in plants. 

10 The problem of obtaining significant Bt ICP 

expression levels in plant cells seems to be inherent 
and intrinsic to the Bt ICP genes. Furthermore, the 
relatively low and poor expression levels obtained an 
plants appears to be a common phenomenon for all Bt ICP 

15 genes. 

It is known that there are six steps at which gene 
expression can be controlled in eucaryotes (Darnell, 
1982) : 

20 1) Transcriptional control 

2) RNA processing control 

3) RNA transport control 

4) mRNA degradation control 

5) translational control 

25 6) protein activity control 

For all genes, transcriptional control is 
considered to be of paramount importance (The Molecular 

Biology of the Cell, 1989) . 

t« Eur opean patent publications ("BP") 385,962 and 



30 



359,472, efforts to modify the codon usage of Bt ICP 
genes to improve their expression in plant cells have 
been reported. However, wholesale (i.e., non-selective) 
changes in codon usage can introduce cryptic regulatory 
signals in a gene, thereby causing problems in one or 
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more of the six st ps m ntioned above f r gene 
expression, and thus inhibiting or interfering with 
transcription and/or translation of the modified 
foreign gene in plant cells. Por example, changes in 

5 codon usage can cause differential rates of mRNA 
production, producing instability in the mRNA, so 
produced (e.g., by exposure of regions of the mRNA, 
unprotected by ribosomes, to attack and degradation by 
cytoplasmic enzymes). Changes in codon usage also can 

10 inadvertantly cause inhibition or termination of RNA 
polymerase II elongation on the so-modified gene. 

Summ ar y of the Invention 

In accordance with this invention is provided a 
15 process for modifying a foreign gene, particularly a Bt 
ICP gene, whose level and/or rate of expression in 
plant cells, transformed with the gene, is limited by 
the rate and/or level of nuclear production of an mRNA 
encoded by the gene; the process comprises the step of 

20 changing adenine and thymine sequences to corresponding 
guanine and cytosine sequences encoding the same amino 
acids in a plurality of translational codons of the 
gene that would otherwise directly or indirectly cause 
a nuclear event which would negatively control (i.e., 

25 inhibit or interfere with) transcription, nuclear 
accumulation and/or nuclear export of the mRNA, 
particularly transcription, quite particularly 
elongation of transcription by RNA polymerase II of the 
plant cells. Preferably, the adenine and thymine 

30 sequences are changed to cytosine and guanine sequences 
in translational codons of at least one region of the 
gene which, during transcription, would otherwise have 
thereon a relatively low percentage of RNA polymerase 
II as compared to another adjacent upstream (i.e., 5 1 ) 
region of the gene. 



Also in accordance with this invention is provid d 
the modified Bt ICP gene resulting from the process. 

Further in accordance with this invention, a 
process is provided for improving the resistance of a 
plant against insect pests by transforming the plant 
cell genome with at least one modified Bt ICP gene* 

This invention also relates to a chimaeric gene 
that can be used to transform plant cells and that 
contains the following operably-1 inked DNA fragments in 
the same transcriptional unit: 

1) the modified Bt ICP gene; 

2) a promoter suitable for directing transcription 
of the modified Bt ICP gene in the plant cells; 
and 

3) suitable transcript 3' end formation and 
polyadenylation signals for expressing the 
modified Bt ICP gene in the plant cells. 

This invention further relates to: 

- a cell of a plant, the nuclear genome of which 
has been transformed to contain, preferably stably 
integrated therein, the modified Bt ICP gene, 
particularly the chimaeric gene; 

- cell cultures consisting of the plant cell; 

- a plant which is regenerated from the 
transformed plant cell or is produced from the 
so-regenerated plant, the genome of which contains 
the modified Bt ICP gene, particularly the 

chimaeric g ene , and which shows improved 

resistance to insect pests; 

- seeds of the plant; and 

- a vector for stably transforming the nuclear 
genome of plant cells with the modified Bt ICP 
gene, particularly the chimaeric gene. 
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Detailed D scription f the Inv ntion 

As used herein, "Bt ICP" should be understood as 
an intact protein or a part thereof which has 
insecticidal activity and which can be produced in 
nature by B. thurincriensis . A Bt ICP can be a pro toxin, 
as well as an active toxin or other insecticidal 
truncated part of a protoxin which need not be 
crystalline and which need not be a naturally occurring 
protein. An example of a Bt ICP is a Bt2 insecticidal 
crystal protein (Hofte et al., 1986), as well as its 
insecticidally effective parts which are truncated at 
its C- and/or N-terminal ends towards its tryspsin 
cleavage site(s) and preferably having a molecular 
weight of 60-80 kDa. Other examples of Bt ICPs are: 
Bt2, Bt3, Bt4, Btl3, Btl4, Btl5, Btl8, Bt21, Bt22, 
Bt73, Bt208, Bt245, BtI260 and BtI109P as disclosed in 
PCT publications WO90/15139 and W090/09445, in Hofte 
and Whiteley (1989) and in EPA 90403724.9. 

As used herein, "protoxin" should be understood as 
the primary translation product of a full-length gene 
encoding a Bt ICP. 

As used herein, "toxin" or "active toxin" or 
"toxic core" should all be understood as a part of a 
protoxin which can be obtained by protease (e.g., by 
trypsin) cleavage and has insecticidal activity. 

As used herein, "truncated Bt gene" should be 
understood as a fragment of a full-length Bt gene which 
still encodes at least the toxic part of the Bt ICP, 
preferentially the toxin. 

As use d herein, "modified Bt ICP gene" should be 

understood as a DNA sequence which encodes a Bt ICP, 
and in which the content of adenine ("A") and thymine 
("T") has been changed to guanine ("G") and cytosine 
("C") in codons, preferably at least 3, in at least one 
region of the DNA sequence without affecting the 



original amino acid sequenc f th Bt ICP. Preferably 
in at least two regions, especially in at least three 
regions, of the DNA sequence, the A and T content is 
changed to G and C in at least 3 codons. For regions 
downstream of the translation initiation site of the 
DNA sequence, it is preferred that the A-T content of 
at least about 10 codons, particularly at least about 
33 codons, be changed to G-C. 

By "region" of a modified Bt ICP gene is meant any 
sequence encoding at least three translatlonal codons 
which affect expression of the gene in plants. 

In accordance with this invention, it has been 
shown by means of mRNA turn-over studies that the 
expression pathway of a Bt ICP gene, such as bt2, btl4, 
bt!5 and bt!8, is specifically inhibited at the nuclear 
level in plant cells. In a further analysis, nuclei of 
transgenic tobacco plants, i.e., N28 - 220 (Vaeck et 
al. f 1987), were used in a nuclear run-on assay to 
determine the distribution and the relative efficiency 
of RNA polymerase II complexes to initiate 
transcription of chimaeric Bt ICP plant genes. In this 
regard, the run-on assay has been used to determine 
initially the relative efficiency of RNA polymerase II 
complexes to initiate transcription of Bt ICP genes and 
thereafter to determine the relative distribution and 
migration efficiency of the RNA polymerase II complexes 
on the Bt ICP genes. 

N28 - 220 contains the bt884 fragment under 
control of the TR 2' promoter as a chimaeric gene. 
— Bt884— is-a— 5'— fragment^of— the-bt2-gene_(H6f te_et_al . , 
1986) up to codon 610 (Vaeck et al., 1987). Using 
nuclear run-on analysis, isolated nuclei of N28 - 220 
were incubated with highly labeled radioactive RNA 
precursors, so that the RNA transcripts being 
synthesized at the time became radioactively labeled. 



10 



The RNA polymerase II molecul s caught in th act f 
transcription in the cell continue elongating the same 
RNA molecules in vitro . 

The nuclear run-on assays of nuclei of N28 - 220 
culture (non-induced cells and induced cells, TRl'-neo, 
TR2'-bt884) revealed that transcription from the TR1 1 
and TR2 f promoters is about equally efficient. This 
implies that the low Bt ICP (i.e., Bt884) expression 
levels are not due to a specifically reduced 
transcriptional activity of the TR2 1 promoter. However, 
nuclear run-on analysis with N28 - 220 nuclei indicated 
that transcription elongation of the nascent Bt ICP 
mRNA is impaired somewhere between 700 to 1000 
nucleotides downstream of the start of transcription. 
This means that RNA polymerase II is not able to 
transcribe the Bt ICP coding sequence with 100 % 
efficiency. Filter binding assays using labeled Bt DNA 
fragments spanning this region and protein extract 
prepared from tobacco nuclei reveal that this DNA 
region undergoes specific interactions with proteins 
present in nuclei. These interactions are the prime 
candidates that cause or affect the impaired elongation 
of transcription by RNA polymerase II through this 
region. By modification of this region to abolish 
specific protein binding, Bt ICP expression levels will 
increase. However, other mechanisms responsible for 
impaired elongation in this region cannot be excluded. 

Further in accordance with this invention, 
sequences within the coding region involved in negative 
control of cytop lasmic Bt ICP mRNA levels have been 
identified by deletion analysis. To this end, 24 
deletion derivatives of pVE36 have been constructed. 
Three main types of deletion mutants have been 
constructed (see fig. 3): 



WO 91/16432 



PCT/EP91/00733 



11 



-5* nd deletions 

- 3 1 end deletions 

- internal deletions. 

5 The expression of a mutant hybrid bt2-neo gene 

(encoding a fusion protein of Bt2 (Hofte et al., 1986) 
and NPTII) has been studied by means of transient 
expression experiments using the cat gene as a 
reference. To this end, the neo mRNA levels were 

10 measured in relation to cat mRNA levels in RNA extracts 
of SRI protoplasts. The ratio between the neo and cat 
mRNA level was used to quantify on a relative basis the 
nptll transcript (i.e., mRNA) levels produced by the 
different constructions. These experiments show that 

15 progressive deletions of the carboxy-terminal (i. e., 
3 f ) part or the amino-terminal (i. e., 5 V ) part of the 
Bt ICP coding sequence result in a gradual increase of 
the not 1 1 transcript level. Furthermore, since the 
changes in transcript levels are not very abrupt, these 

20 results suggest that the low transcript levels produced 
by Bt ICP genes are not controlled by a single factor. 
Nevertheless, individual modifications of bt2 coding 
sequence can significantly reduce the interference 
and/or inhibition of the expression of the mRNA encoded 

25 by Bt ICP genes in plant cells at the level of 
transcript elongation, nuclear accumulation and nuclear 
export. The modification (s) may also affect cytoplasmic 
regulation and metabolism of such mRNAs and their 
translation. 

30 Deletion analysis clearly indicates that several 

~~ Interiial~~seqi^ 
region, might be involved in the negative regulation of 
the Bt ICP expression. By way of example, a 326 bp 
region (fig. 6b) was identified in the bt2 gene that is 
involved in the negative control of BT ICP expression 
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and that is 1 cated b tw n nucleotide position 674 and 
nucleotide position 1000, particularly a 268 bp region 
between nucleotide positions 733 and 1000, quite 
particularly a 29 bp region between nucleotide 
positions 765 and 794 which carries two perfect CCAAT 
boxes which are known to be able to cause a reduction 
in elongation efficiency and termination of 
transcription by RNA polymerase II in animal systems 
(Connelly and Man ley, 1989). This internal gene 
fragment or inhibitory zone may itself comprise a 
plurality of inhibitory zones which reduce Bt ICP 
expression levels or which interact directly or 
indirectly with other zones to inhibit or interfere 
with expression. Codon usage of this inhibitory zone 
has been modified in a second step by substituting A - 
T with G - C without affecting the amino acid sequence. 
In this regard, this internal 326 bp fragment (fig. 6b) 
has been replaced with a modified Bt ICP fragment of 
this invention containing 59 modified codons. The 
effect of such modification of this inhibitory zone on 
Bt ICP expression has been analyzed both in transient 
and stable plant transformants. The results show that 
such jnodification of codon usage causes a significant 
increase of Bt ICP expression levels and hence improved 
insect-resistance . 

In addition, N-terminal deletion mutants of the 
bt2 gene have been made by deleting the first N- 
terminal 28 amino acids (Hofte et al., 1986). It is 
known for the bt2 gene that the first 28 codons can be 
deleted without loss of toxicity (Hofte et al., 1986; 
Vaeck et al., 1987). Also, codon usage for three 
codons, 29 to 31, has been changed in accordance with 
this invention by replacing A - T with G - c without 
affecting the amino acid sequence. Furthermore, an 
optimal translation initiation (ATG) site was created 
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bas d n the consensus s guence f Joshi (1987) as 
shown in fig. 6a. Plants transformed with this modified 
Bt ICP gene show significantly higher Bt ICP expression 
levels. 

In accordance with this invention, all or part of 
a modified Bt ICP gene of the invention can be stably 
inserted in a conventional manner into the nuclear 
genome of a plant cell, and the so-transformed plant 
cell can be used to produce a transgenic plant showing 
improved expression of the Bt ICP gene. In this regard, 
a disarmed Ti-plasmid, containing the modified Bt ICP 
gene, in Agrobacterium (e.g., tumefaciens ) can be 
used to transform a plant cell using the procedures 
described, for example, in EP 116,718 and EP 270,822, 
PCT publication 84/02913, EPA 87400544.0 and Gould et 
al. (1991) (which are incorporated herein by 
reference) . Preferred Ti-plasmid vectors contain the 
foreign DNA sequence between the border sequence, or at 
least located to the left of the right border sequence, 
of the T-DNA of the Ti-plasmid. Of course, other types 
of vectors can be used to transform the plant cell, 
using procedures such as direct gene transfer (as 
described, for example, in EP 233,247), pollen mediated 
transformation (as described, for example, in EP 
270,356, PCT publication WO 85/01856, and US patent 
4,684,611), plant RNA virus-mediated transformation (as 
described, for example, in EP 67,553 and US patent 
4,407,956), liposome-mediated transformation (as 
described, for example, in US patent 4,536,475) and 
other— methods_such_as_Uie_recently-_describe<i_methods__ 
for transforming certain lines of corn (Fromm et al., 
1990; Gordon-Kamm et al., 1990). 

Preferably, the modified Bt ICP gene is inserted 
in a plant genome downstream of, and under the control 
of, a promoter which can direct the expression of the 
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g n in th plant c lis. Pr f erred pr moters include, 
but are not limited to, the strong constitutive 35S 
promoter (Odell et al., 1985) of cauliflower mosaic 
virus; 35S promoters have been obtained from different 
isolates (Hull and Howell, Virology 86, 482-493 
(1987)). Other preferred promoters include the TR1* 
promoter and the TR2 1 promoter (Velten et al., 1984). 
Alternatively, a promoter can be utilized which is not 
constitutive but rather is specific for one or more 
tissues or organs. For example, the modified Bt ICP 
gene can be selectively expressed in the green tissues 
of a plant by placing the gene under the control of a 
light-inducible promoter such as the promoter of the 
ribulose - 1,5 - phosphate - carboxylase small subunit 
gene as described in EPA 86300291.1. Another 
alternative is to use a promoter whose expression is 
inducible by temperature or chemical factors. 

It is also preferred that the modified Bt ICP gene 
be inserted upstream of suitable 3 1 transcription 
regulation signals (i.e., transcript 3' end formation 
and polyadenylation signals) such as the 3' 
untranslated end of the octopine synthase gene (Gielen 
et al., 1984) or T-DNA gene .7 (Velten and Schell, 
1985). 

The resulting transformed plant of this invention 
shows improved expression of the modified Bt ICP gene 
and hence is characterized by the production of high 
levels of Bt ICP. Such a plant can be used in a 
conventional breeding scheme to produce more 
transformed plants with the same improved insect- 
resistance characteristics or to introduce the modified 
Bt ICP gene into other varieties of the same or related 
plant species. Seeds, which are obtained from the 
transformed plants, contain the modified BtlCP gene as 
a stable genomic insert. 
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Furthermore, at 1 ast two modifi d BtlCP g nes, 
coding for two non-competitively binding anti- 
Lepidopteran or anti-Coleopteran Bt ICPs, can be cloned 
into a plant expression vector (EPA 89401499.2). Plants 
transformed with such a vector are characterized by the 
simultaneous expression of at least two modified BtlCP 
genes. The resulting transgenic plant is particularly 
useful to prevent or delay development of resistance to 
Bt ICP of insects feeding on the plant. 

The following Examples illustrate the invention 
and are not intended to limit its scope. The Figures, 
referred to in the Examples, are as follows: 

Fig.l — Comparison of the transcription initiation 
frequency of RNA polymerase II complexes in nuclei of 
N2 8-220. Hybridisation efficiencies of labeled nptll 
mRNA and Bt ICP mRNA with their complementary DNA 
counterparts present on a Southern blot were compared. 
DNA fragments were obtained from a digest of plasmid 
pGSH163. A schematic view of the region is given. The 
lengths of the fragments blotted on Hybond-N filter 
(1), the homologous genes on plasmid p6SH163 (2), and 
the densitometric values (3) are as follows: 



25 



30- 



Digest: 


1 


2 


3 


BamHI/Hindlll 


2358 


neo 


12386 




1695 


bt2 


6565 




154 


bt2 






6250 


vector 





Fig. 2a ~ Determination of the distribution of the RNA 
polymerase II complexes on the Bt ICP coding sequence 
in nuclei of N28-220. The hybridisation of labeled RNA 
prepared by nuclear run on with DNA fragments of the Bt 
ICP coding sequence was quantitated. The restriction 
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fragments and scanning values are given in th tabl 
and figure. The scanning value is proportional to "X", 
the size of the DNA fragment and the # OTP per RNA 
fragment hybridising. "X" is directly proportional to 
5 the number of RNA polymerases passing through the DNA 
fragment. »X- is proportional to the scanning value 
divided by the number of OTPs. The X values of the 
different restriction fragments are shown in the 
figure. In this regard, conversion of the different 
10 densitometric values into relative hybridisation 
efficiencies by normalising the values of the number of 
dATPs present in the DNA fragment, complementary to the 
hybridising RNA, generates the value "X". »X» is a 
relative measure of the number and the length of the 
» extension of the transcripts. -X- thus reflects the 
number of RNA polymerases transcribing a specific DNA 
sequence and their elongation rate. DNA fragments 
present on the Southern digests of plasmid DNA of plant 
vector pGSH163 each have the following lengths of 
20 fragments blotted on Hybond-N filter (1), homologous 
genes on plasmid pGSH163 (2) and densitometric values 

(3): 
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neo 

bt2(l) 
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neo 



bt2(3) 
bt2(2) 
bt2(4) 
bt2(5) 
bt2(l) 



6565 

14194 

4572 

615 

13219 



736 
1817 



Fig. 2b — Schematic view of nine bt884 DNA fragments 
that were inserted into the poly linker of M13 vectors, 
MP18 and MP19 (Yanisch-Perron et al., 1985). The Bt ICP 
coding sequence is shown from AUG to 1600 nucleotides 
downstream. The relevant restriction sites and sizes of 
the DNA fragments are indicated. The nucleotide 
numbering is relative to the AUG. The subclones were 
named pJD71, pJD72, pJD73, etc. (to pJD79) , as 
indicated. The inserts were oriented into the M13 
vector such that single standed M13 carried the 
fragments of the Bt ICP coding sequence in an anti- 
sense orientation. 



Fig. 2c — Schematic representation of three nuclear 
30 — Tnin«Qn-analyse&-with-N28-220_nuclei as described by Cox 



and Goldberg (1988) . Assays were performed for periods 
of 5, 10 and 30 minutes. The labeled nuclear RNA was 
allowed to hybridize with 5 fsq of single stranded 
pJD71-pJD79 and MP18 DNA, which were immobilised on 
nylon membranes. The membranes were autoradiographed, 
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and dens i torn trie valu s v re obtained by scanning th 
autoradiographs. The abscissa shows the nucleotide 
position relative to the AUG of the Bt (i.e., bt2) 
coding sequence. The center of each of the single 
stranded Bt DNA fragments is indicated in the graph. 
The ordinate gives the relative hybridisation signal 
for each fragment corrected for the number of dATPs in 
the fragment and adjusted to 100% for the value of 
pJD71 for each of the three incubation periods. All 
values are corrected for non-specific hybridisation to 
single stranded MP18 DNA. The relative values are a 
measure for the reactivation of bt mRNA synthesis by 
RNA polymerase II. The assay does not distinguish 
between the number of mRNA extensions and the length of 
mRNA extensions. 

Fig. 3 — Construction of deletion mutants of the 
bt860-neo gene to measure the effect on cytoplasmic Bt 
ICP mRNA levels. The parental vector pV£36 is shown. 
The following deletion mutants were generated: 
1* PJD50: pJD50 was derived from pVE36 by digesting 
with BamHI and Sphl. The 5»and 3' 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp r phenotype. 
2. PJD51: pJDSl was derived from pVE36 by digesting 
with Spel and Sphl. The 5»and 3 1 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp r phenotype. 
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3. PJD52 : 



4. PJD53: 



10 



15 5. PJD54: 



20 



6. PJD55: 



25 



7. PJD56: 



-30- 



pJD52 was d riv d from pV£36 by digesting 
with EcoRV and Sphl. The 5 'and 3 f 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
HC1061 cells. Trans forma nts were selected 
for amp r phenotype. 

pJD53 was derived from pV£36 by digesting 
with Xcal and Sphl, The 3' protruding ends 
were filled in with Klenow DNA polymerase 
I enzyme. The treated DNA was ligated and 
then used to transform MC1061 cells. 
Trans forma nts were selected for amp r 
phenotype. 

pJD54 was derived from pVE36 by digesting 
with Aflll and Sphl. The 5" and 3» 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp r phenotype. 

pJD55 was derived from pVE36 by digesting 
with Clal and Sphl. The 5 f and 3 1 
protruding ends were filled in with Klenow 
DNA polymerase 1 enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp r phenotype. 

pJD56 was derived from pVE36 by digesting 

with — Xhol — and— Sphl The — 5_? and 3 f _ 

protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp r phenotype. 
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8. PJD57: pJD57 was d rived from pVE36 by digesting 

with Aflll and BamHI. The 5 1 and 3' 
protruding ends were filled in with Klenow 
DNA polymerase I enzyme. The treated DNA 
was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp r phenotype. 

9. PJD58: pJD58 was derived from pV£36 by digesting 

with Xcal and BamHI. The 5 1 protruding 
ends were filled in with Klenow DNA 
polymerase I enzyme. The treated DNA was 
ligated and then used to transform HC1061 
cells. Transformants were selected for 
amp r phenotype. 

10. PJD59: pJD59 was derived from pV£36 by digesting 

with EcoRV and BamHI. The 5' protruding 
ends were filled in with Klenow DNA 
polymerase I enzyme. The treated DNA was 
ligated and then used to transform HC1061 
cells. Transformants were selected for 
amp r phenotype. 

11. PJD60: pJD60 was derived from pVE36 by digesting 

with Spel and BamHI. The 5 1 protruding 
ends were filled in with Klenow DNA 
polymerase I enzyme. The treated DNA was 
ligated and then used to transform HC1061 
cells. Transformants were selected for 
amp r phenotype. 

12. PJD61: PJD61 was derived from PJD50. PVE36 was 
digested with Xbal and filled in with 



Klenow polymerase I. PJD50 was linearized 
with BamHI and filled in with Klenow 
polymerase I. The 375bp Xbal fragment of 
PVE36 was ligated in the filled in BamHI 
of pJD50. The ligation mixture was used to 
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transform HC1061 cells. Trans fonnants were 
selected for amp r phenotype. 

13. PJD62: PJD62 was derived from PJD50. FVE36 was 

digested with Xcal and EcoRV. PJD50 was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 367bp Xcal -EcoRV 
fragment of PVE36 was ligated in the 
filled in BamHI of pJD50. The ligation 
mixture was used to transform NC1061 
cells. Transformants were selected for 
amp r phenotype. 

14. PJD63: PJD63 was derived from PJD50. PVE36 was 

digested with Xcal and EcoRV. PJD50 was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 474bp Xcal-EcoRV 
fragment of PVE36 was ligated in the 
filled in BamHI of pJDSO. The ligation 
mixture was used to transform HC1061 
cells. Transformants were selected for 
amp r phenotype. 

15. PJD64: PJD64 was derived from PJDSO. FVE36 was 

digested with EcoRI and EcoRV and filled 
in with Klenow polymerase I. PJDSO was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 458bp EcoRI -EcoRV 
fragment of PVE36 was ligated in the 
filled in BamHI of pJDSO. The ligation 
mixture was used to transform HC1061 
cells. Transformants were selected for 
amp^-phenotype. _ — — _ 



16. PJD65: PJD65 was derived from PJDSO. PVE36 was 
digested with EcoRI and Xbal and filled in 
with Klenow polymerase I. PJDSO was 
linearized with BamHI and filled in with 
Klenow polymerase I. The 327bp EcoRI -Xbal 
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fragment of FVE36 was ligated in the 
filled in BamHI of pJD50. The ligation 
mixture was used to transform NC1061 
cells* Transformants were selected for 
amp r phenotype* 

17. PJD66: PJD66 was derived from PJD50. PVE36 was 

digested with Spel and Xcal and filled in 
with Klenow polymerase I. PJD50 was 
linearized with BamHI and filled in with 
Klenow polymerase I* The 102 lbp Spel -Xcal 
fragment of PVE36 was ligated in the 
filled in BamHI of pJD50. The ligation 
mixture was used to transform HC1061 
cells. Transformants were selected for 
amp r phenotype. 

18. PPS56D1: PPS56D1 was derived from PJD56 by 

digesting with EcoRV. The treated DNA was 
ligated and then used to transform MC1061 
cells. Transformants were selected for 
amp r phenotype. 

19. PPS56D2: PPS56D2 was derived from PJD56 by 

digesting with Xcal and Aflll. The 5' 
protruding ends were filled in with Klenow 
polymerase I. The treated DNA was ligated 
and then used to transform MC1061 cells. 
Transformants were selected for amp r 
phenotype. 

20. PPS56D3: PPS56D3 was derived from PJD56 by 

digesting with Spel and EcoRV. The 5 1 

protruding ends were filled in with Klenow 

polymerase I. The treated DNA was ligated 
and then used to transform HC1061 cells. 
Transformants were selected for amp r 
phenotype. 
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21. PPS56D4: PPS56D4 was deriv d from PJD56 by 

digesting with Xcal and partially with 
EcoRV. The treated DNA was ligated and 
then used to transform MC1061 cells. 
Transformants were selected for amp r 
phenotype. 

22. PPS56D6: PPS56D6 was derived from PJD56 by 

digesting with Spel and partially with 
EcoRV. The 5" protruding ends were filled 
in with Klenow polymerase I. The treated 
DNA was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp p phenotype. 

23. PPS56D7: PPS56D7 was derived from PJD56 by 

digesting with Spel and Xcal. The 5' 
protruding ends were filled in with Klenow 
polymerase I. The treated DNA was ligated 
and then used to transform MC1061 cells. 
Transformants were selected for amp r 
phenotype. 

24. PPS56D8: PPS56D8 was derived from PPS56D2 by 

digesting with Spel and partially with 
EcoRV. The 5 1 protruding ends were filled 
in with Klenow polymerase I. The treated 
DNA was ligated and then used to transform 
MC1061 cells. Transformants were selected 
for amp r phenotype. 

Fig. 4 — Effect of deletions in the Bt ICP coding 
sequence on cytoplasmic Bt ICP mRNA levels. The 
cytoplasmic mRNA levels specified by the invariable cat 
reference gene and the different Bt ICP deletion 
mutants described in fig. 3 are listed in the table. 
The measurements were converted into relative Bt ICP 
mRNA abundances. Bt ICP and cat mRNA quantizations were 
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done as described by Cornel issen (1989) . Total RNA was 
slot blotted and hybridised with radioactively labeled 
RNA complementary to the neo and cat coding sequences. 
Values were quant itated with the aid of calibration 
curves of cold cat and Bt ICP riboprobe transcripts. 

Fig. 5 — Relative transcript levels produced by the 
deletion derivatives of pVE36. 

Fig. 6a — Schematic presentation of the synthetic DNA 
sequences used to introduce a N-terminal deletion and a 
change of the codons 29, 30 and 31 of the bt2 coding 
sequence. The oligo nucleotides were annealed according 
to Engler et al. (1988) and cloned into the BstXI 
restriction site of plasmid pVE36, yielding pPS027. The 
7360 bp fragment of pPS027 was ligated to the the 1177 
bp Clal restriction fragment of pVE36, yielding plasmid 
pPS028. pPS028 is identical to pVE36 apart for the N- 
terminal modification. 

Fig. 6b — Schematic presentation of the synthetic DNA 
sequences used to introduce an internal modification 
into the bt2 coding sequence. The oligonucleotides were 
annealed and ligated as described by Engler et al. 
(1988) and the resulting concatemeric DNA fragment was 
cut with the restriction enzymes Xbal and EcoRI to 
release the modified 327 bp Xbal-EcoRI restriction 
fragment. This fragment was ligated into the 3530 bp 
EcoRI-Xbal fragment of pPS023 which is a pUC19 
derivative (Yanisch-Perron et al., 1985) that carries 
the 1533 bp Aflll (filled in) BamHI fragment of pVE36 
~£n the Hindlll (filled in) BamHI site of pUC19, 
resulting in plasmid p PS 02 4. Plasmid p PS 02 4 was 
linearised by digestion with restriction enzyme Xbal 
and the 375 bp Xbal restriction fragment of pPS023 was 
introduced resulting in pPS025. The 1177 bp Clal 
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fragment f pPS025 was introduc d in the 7360 bp Clal 
restriction fragment of pPS027 yielding pPS029. pPS029 
is identical to ,pVE36 but carries both the amino- 
terminal modification and the internal modification of 
the Bt ICP coding sequence. 

Fig. 6c — Nucleotide sequences 800 to 4000 of the 
plasmids pVE36 and pPS029. "x" refers to not known 
nucleotides. 

Fig. 7 Schematic presentation of the effect of the 
mutations on the AT content of the Bt ICP plant gene. 
The modified regions are indicated. 

Fig. 8a — Schematic presentation of the plasmid 
constructions used in the transient expression assay. 
The relevant genes are indicated. 

Fig. 8b — Accumulation profiles of CAT (Neumann et 
al., 1987) and the modified BtlCP (Engvall and Pesce, 
1978) in a typical transient expression assay. 

Unless otherwise stated in the Examples, all 
procedures for making and manipulating recombinant DNA 
are carried out by the standardized procedures 
described in Sambrook et al., Molecular Cloning - A 
laboratory Manual , Cold Spring Harbor Laboratory 
(1989) . 

Example 1. Determination of the Efficiency of 
Transcription Initiation 

The — relative — efficiency — of — RNA polymerase II_ 

complexes to initiate transcription at chimaeric BtlCP 
plant genes was studied, using transgenic plant N28-220 
which is described by Vaeck et al. (1987) and contains 
copies of the T-DNA of plasmid pGSH163 This T-DNA 
carries the chimaeric plant genes P TR2 bt8843 'g7 and 
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PjM.neoS'ocs. Nuclei of 25 g of induced leaves of 
N28-220 were prepared according to Cox and Goldberg 
(1988) and stored the nuclei at a temperature of -70°C. 
This method causes the nascent precursor mRNA chains 
and the RNA polymerase II complexes to halt while the 
complexes remain associated at the DNA. A batch of 
these nuclei was assayed for the ability to incorporate 
radioactively labeled UTP as a measure for the 
transcriptional viability of the nuclei (Cox and 
Goldberg (1988) . This incorporation could be 
successfully repressed by addition of ce-amanitin to a 
final concentration of 2 nq/nl. This shows that the UTP 
incorporation was due to transcript elongation by RNA 
polymerase II and that RNA synthesis on the protein 
coding genes which are occupied by RNA polymerase II 
can be reactivated under the appropriate experimental 
conditions. 

Batches of the nuclei of N28-220 were used to 
synthesize radioactively labeled RNA as described by 
Cox and Goldberg (1988) . The radioactive RNA 
synthesized is a direct representation of the 
distribution of the RNA polymerases II complexes on the 
DNA in the nuclei. As the DNA of N28-220 carries two 
genes which can be assayed, namely the chimaeric neo 
gene and the chimaeric Bt ICP gene, it is possible to 
compare the distribution of RNA polymerase II complexes 
on these two genes. To this end, the radioactive RNA 
was extracted from the nuclei according to Cox and 
Goldberg (1988) and used as a probe in a conventional 
Southern hybridisation. The Southern blot contained DNA 



fragments carrying the Bt ICP and neo coding sequences 
in a molar excess relative to the neo and Bt ICP RNA 
species present in the radioactive probe. A detailed 
description of the Southern blot is given in fig. 1. 
The hybridisation experiment resulted in hybridisation 
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signals to both the neo and Bt ICP coding sequ nc s 
(fig. 1). Dens i torn etric scanning shoved that the 
intensity of the hybridisation signal to the neo and Bt 
ICP coding regions was nearly identical. This result 
5 implies that the number of transcripts initiating from 
the TR dual promoter is about similar in both 
directions. As in plant N28-220 the cytoplasmic neo 
mRNA level is several magnitudes higher than that of Bt 
ICP; this shows that the Bt ICP coding sequence indeed 
10 negatively controls accumulation of cytoplasmic Bt ICP 
mRNA, but that this phenomenon is not due to a dominant 
negative effect on transcription initiation of the 
chimaeric Bt ICP plant gene. 

15 Example 2. Transcription Elongation 

The relative distribution of RNA polymerase II 
complexes on the Bt ICP plant genes present in 
transgenic plant N2 8-220 which is described by Vaeck et 
al. (1987) was investigated. To this end, a second 

20 experiment was carried out with batches of the nuclei 
of N28-220 described in Example 1. 

The nuclei were incubated as described by Cox and 
Goldberg (1988) to synthesize radioactively labeled 
RNA. The radioactive RNA was extracted as described 

25 previously to provide a probe for a Southern 
hybridisation. The Southern blot prepared for this 
experiment contained several fragments of the Bt ICP 
coding sequence in molar excess relative to the 
complementary RNA present in the probe. The rationale 

30 of the experiment was that if the RNA polymerase II 
complexes were equall^^di¥1^ibuted over the Bt ICP 
coding region, the hybridisation with the different Bt 
ICP DNA fragments present on the Southern blot would be 
proportional to the size and dATP content of the 
different fragments. A detailed description of the DNA 
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fragm nts present on the Southern is given in fig 2a. 
The hybridisation of the radioactive RNA extracted from 
the nuclei of N28-220 with the Southern revealed that 
the complete Bt ICP coding sequence as present in 
N28-220 is transcribed by RNA polymerase II. 

Quantification of the hybridisation signals by 
dens i tome trie scanning of the autoradiogram shoved that 
more radioactively labeled RNA was hybridising with DNA 
fragments representing Bt ICP sequences located 5 1 on 
the Bt ICP coding sequence than with Bt ICP sequences 
located 3' on the Bt ICP coding sequence. The actual 
values are given in fig 2a. This in vitro experiment 
demonstrates that in vivo the RNA polymerases are not 
evenly distributed over the Bt ICP coding sequence. 

The site(s) involved in reducing the RNA 
polymerase II elongation were then determined more 
accurately. Nine M13 derivatives were made that carry 
overlapping fragments of the Bt2 coding sequence 
spanning the region from the AUG to 1584 nucleotides 
downstream. The inserts were oriented into the vector 
such that, in single stranded M13 derivatives , the Bt 
sequences were complementary to the Bt transcript. A 
schematic view of the Ml 3 clones is given in fig. 2b. 

A molar excess of each single stranded anti-Bt DNA 
was bound to nylon filters to serve as a DNA target for 
hybridisation with labeled RNA prepared from nuclear 
run-on assays with N28-220 nuclei as described by Cox 
and Goldberg (1988). Three nuclear run-ons that 
differed only in their time period of incubation were 
carried out simultaneously. The incubation time 
determines the length of extension of the nascent mRNA 
chain. Shorter incubation periods give a more accurate 
view of the position of the RNA polymerase II complexes 
relative to the substrate DNA and their ability to 
elongate at the moment of the start of incubation. 
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Henc , th sh rter th In vitro incubation p riod, the 
more accurate the view in predicting the in vivo 
situation. 

The results are shown in fig. 2c . The data for the 
5 minute incubation show that, in vivo , at a very 
discrete inhibitory zone along the bt2 coding sequence, 
one or more factors interfere with transcript 
elongation and that such factor (s) remain present in 
such inhibitory zone during the course of the in vitro 
mRNA extension reaction. Increased incubation periods 
show that, on a subset of DNA templates, RNA synthesis 
resumed downstream of such inhibitory zone in this 
assay without significantly removing the inhibition in 
the inhibitory zone itself. In this regard, the data 
indicate that: 

1. The inhibitory zone causes the RNA polymerases to 
pause and not to terminate. 

2. This pause is only transitory for a small fraction 
of the Bt DNA templates which were used. 

3. The continued RNA polymerase elongation, 
downstream of the inhibitory zone, is done by a 
large number of polymerases on the relatively 
small fraction of the Bt DNA templates. 

It is believed, therefore, that low cytoplasmic Bt 
mRNA levels are due at least in part to inefficient 
production of precursor mRNA caused by inefficient 
elongation of a nascent transcript and/or stalling of 
RNA polymerase II complexes from transcribing at an 
Tnhibitory zone. 

The inhibitory zone was assayed for its ability to 
interact with proteins present in nuclei of tobacco 
protoplasts. A crude nuclear extract was prepared from 
tobacco SRI leaf protoplasts according to Luthe and 
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Quatrano (1980) and used f r filt r binding assay 
essentially as described by Diffley and Stillman 
(1986) . 100 ng samples of protein extract were mixed 
with different amounts of radioactively labeled 532 bp 
5 Xbal-AccI bt884 DNA fragment, ranging from 0 to 1670 
picomolar, in a final volume of 0.150 ml binding buffer 
(10 mH Tris pH 7.5, 50 mM NaCl, 1 mH DTT, 1 mH EDTA and 
5% glycerol) . After 45 minutes incubation at room 
temperature, the samples were filtered through an 

10 alkali-washed nitrocellulose membrane and washed twice 
with 0.150 ml of an ice-cold solution containing 10 mM 
Tris pH 7.5, 50 mM NaCl and 1 mM EDTA. The retention of 
DNA-protein complex was quantified by scintillation 
counting and revealed that the binding had a 

15 dissociation constant in the 100 picomolar range. The 
binding was not affected by preincubation of the 
nuclear extract with a molecular excess of a specific 
competitor DNA. 

20 Example 3. Construction of Deletion Mutants 

The previous two examples demonstrate that the Bt 
ICP coding sequence in a chimaeric plant gene 
negatively affects the cytoplasmic Bt ICP mRNA level 
directed by the chimaeric plant gene. It is shown that 
25 this negative control is not at the level of 
transcription initiation but at least in part due to a 
reduced ability of RNA polymerase II to generate 
precursor Bt ICP mRNA. A deletion analysis of the 
chimaeric Bt ICP plant gene was performed to identify 
30 whether impaired transcription elongation is the 

exclusive— mechanism — by — which — the^Bt — ICP — sequence 

interferes with gene expression. The rationale of the 
experiment is that the introduction of specific 
deletions in the Bt ICP coding region could remove or 
inactivate the sequence element (s) responsible for the 
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n gative control. As a r suit such mutant gene would 
direct an increased level of cytoplasmic mRNA. This 
method can therefore be used to map and identify the 
sequence(s) involved in the negative control. 
5 To perform this analysis, a deletion series of the 

bt860- neo gene (Vaeck et al. f 1987) was made. Pig. 3 
gives a schematic representation. The resultant 
deletion derivatives do not specify a Bt ICP and 
therefore are assayed at RNA level only. In order to 
10 obtain accurate Bt ICP mRNA concentration values, the 
deletion mutants were compared in a transient 
expression system using tobacco leaf protoplasts of SRI 
(Cornelissen and Vandewiele, 1989). The relative mRNA 
abundances were calculated using a correction factor 
15 provided by the mRNA level specified by the cat 
reference gene present on the same plasmid as the 
mutant Bt ICP gene. Four hours after introduction of 
the genes the tobacco leaf protoplasts were harvested, 
and total RNA was prepared and analysed (fig. 4). 
20 mutants nos. 50-60 (fig. 3) show that 

progressive deletions of the carboxy-terminal part or 
the amino-terminal part of the Bt ICP coding sequence 
result in a gradually increasing neo transcript level. 
As there are not very abrupt changes in transcript 
25 levels, these results suggest that the low transcript 
level produced by full length Bt ICP genes is 
controlled by a number of signals. Deletions within the 
Bt ICP coding sequence indeed did not localise a 
specific sequence element which, by itself, is 
30 respo nsible for the low Bt ICP mRNA level. Similarly , 
cloning of fragments of the Bt ICP coding sequence in 
pJD50 (fig. 3) did not allow identification of such a 
region. 

The relative transcript levels were plotted 
against the length of the Bt ICP sequence present in 
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the differ nt del tion derivatives. Pig. 5 suggests 
that hybrid Bt ICP-neo transcript levels drop with 
increasing length of the Bt ICP sequence. In this 
respect, the mutants nos. 61-66 (fig. 3) form an 
exception as they show in average a low transcript 
level relative to the length of the Bt ICP sequence. 

These results show that the low transcript levels 
of Bt ICP plant genes in tobacco are not exclusively 
^ due to an impaired elongation of the nascent transcript 
but that a number of signals operate to cause a reduced 
expression capacity of the chimaeric Bt ICP gene. 

Example 4» 

To determine whether cytoplasmic events are 

15 important in causing inefficient expression of the bt2 
gene in plants, the following test was carried out. 
Cytoplasmic bt2 mRNA steady state levels in transgenic 
leaf protoplasts of N28-220 are normally found to be 
below 1 transcript per cell. The steady state level is 

20 determined by, and is proportional to, the number of 
bt2 transcripts entering per time unit the cytoplasm 
and the cytoplasmic half-life of the transcript. When 
steady state levels are achieved, the absolute numbers 
of transcripts entering and leaving the cytoplasmic bt2 

25 mRNA pool are equal. Therefore, the cytoplasmic half- 
life and cytoplasmic steady state level of the bt2 
transcript will reveal whether its cytoplasmic steady 
state level is due to a relatively low import of bt2 
transcript, a relatively high turnover (i.e., 

30 conversion to a protein) rate, or a combination of 
both. = = 

The cytoplasmic turnover of bt884 transcripts was 
determined according to Gallie et al. (1989). a capped 
and polyadenylated synthetic bt884 mRNA was produced in 
vitro according to protocols of Promega Corporation 
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(Madison! Wisconsin, USA) and introduced into tobacco 
leaf protoplasts simultaneously with a synthetic bar 
(De Block et al., 1987) mRNA. The two synthetic 
transcripts differed only in their coding sequences. At 
various times after RNA delivery, samples were taken, 
and total RNA was isolated. Northern analyses revealed 
that the half -lives (T 1/2) of the synthetic bt884 and 
bar transcripts were about 8 + 3 hours and 5 + 2 hours, 
respectively. See Table l f below. These data show that 
the bt884 coding sequence, more particularly the bt884 
codon usage and the AU-rich motifs in the bt884 coding 
sequence, do not render the bt884 mRNA more unstable 
than the bar mRNA which is known to accumulate in the 
cytoplasm to about 1000 transcripts per tobacco leaf 
protoplast (calculated from Cornel issen, 1989) . The low 
cytoplasmic steady state level of the bt884 transcripts 
is, therefore, caused by a lack of import of 
transcripts into the cytoplasm. Thus, the expression 
defect of the bt884 gene has to be restored by 
introduction of modifications in the bt884 coding 
sequence that improve the expression pathway in the 
nucleus. 

Expression of the btl4, bt!5 and bt!8 genes in 
tobacco revealed that these genes also direct low 
cytoplasmic mRNA steady state levels. Therefore, a 
similar analysis was carried out with synthetic btl4, 
bt!5 and bt!8 transcripts to identify whether the 
expression defect had a cytoplasmic or nuclear 
character. Table 1, below, shows that all three 
transcripts^behave-as-stabler-mRNAs— in-the-cytoplasm-of- 
tobacco leaf protoplasts. Therefore, bt!4, bt!5 and 
bt!8 genes, like the bt884 gene, must be deficient in 
exporting high levels of bt transcript to the 
cytoplasm, and to improve the expression of such genes, 
it is necessary to modify their coding sequences so 
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that nucl ar events, which int rfer with fficient 
gene expression, are avoided or ameliorated. 

Table 1 

5 

Half-life determination of synthetic bt and bar mRNAs 
in Nicotiana tabacum cv. Petite Havanna SRI leaf 
protoplasts 



10 



15 



Example 


l 8t mRNA 


Tl/2 


2 nd mRNA 


Tl/2 






(Hours) 




(Hours) 


A 


bt884 


8+/-3 


bar 


5+/-2 


B 


bt!4 


7+/-2 


bar 


6+/"3 


C 


btl5 


12+/-5 


bar 


21+/-12 


D 


bt!8 


10+/-5 


bar 


12+/-5 



Legend 



The synthetic bar transcripts had a length of 783 bases 
and included a cap, the TMV leader (77 bases, Danthinne 
and Van Emmelo, 1990) , the bar coding sequence (552 
bases; De Block et al., 1987), a trailer of 52 
nucleotides consisting of the bases GAUCA CGCGA AUU and 
39 bases from the pGEM-3Z (Promega) polylinker (Kpnl 
(T4 DNA pol.)-HindIII (T4 DNA pol.), and a poly (A) of 
the composition (AJ^A^GfAJjj, followed by the 
nucleotides GCU. 

The synthetic bt884 transcripts had a length of 2066 
bases and included a cap, the TMV leader (77 bases) , 
-the- bt88_4^ _CQdinq^seguence followed by the trailer nnr_n_ 
the Klenow treated PstI site (1843 nucleotides), the 
trailer continued with AAUUC CGGGG AUCAA UU, 39 bases 
of the pGEM-3Z polylinker and the (A) 33 G(A) 32 G(A) 21 
poly (A) , followed by the nucleotides CG. 
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The synth tic bt!4 transcripts had a length of 2289 
bases and included a cap, the TMV leader (77 bases) , 
the bt!4 coding sequence till the Klenov treated Bell 
site (2023 bases) , plus 26 supplementary nucleotides CG 
UCG ACC U6C AGC CAA GCU UGC UGA, a trailer starting 
with UU6AU UGACC GGAUC CGGCU CUAGA AUU, followed by 39 
bases of the pGEM~3Z poly linker, and the 
(A) 33 G(A) 32 G(A) 21 poly (A), followed by the nucleotides 
CGGUA CCC. 

The synthetic bt!5 transcripts had a length of 2198 
bases and included a cap, the TMV leader (77 bases) the 
btlS coding sequence as in pVE35 (PCT publication 
W090/15139) followed by the trailer till the Klenow 
treated BamHI site (1989 bases) , the trailer then 
continued with AAUU, 39 bases of the pGEM-32 polyl inker 
and the (A) s G(A)3gG(A) 2 i poly (A) , followed by the 
nucleotides CG. 

The synthetic bt!8 transcripts had a length of 2184 
bases and included a cap, the TMV leader (77 bases) the 
bt!8 coding sequence until the Klenow treated BcLI site 
(1918 bases) , followed by 26 nucleotides until the 
translation stop CG UCG ACC UGC AGC CAA GCU UGC UGA, a 
trailer starting with UUGAU UGACC GGAUC GAUCC GGCUC 
AGAUC AAUU, 39 bases of the pGEM-3Z polyl inker and the 
(A) 33 G(A) 32 G(A) 21 poly (A), followed by the nucleotides 
CG. 

Example 5. Construction of Modified Bt ICP Genes 

Examples 1- 4 show that the expression in a plant 
of a Bt ICP gene is negatively affected by the Bt ICP 
coding sequence at both transcriptional and post* 
transcriptional levels, but principally by nuclear 
events. These examples also show that the control of 
expression is not confined to a specific DNA sequence 
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within the Bt ICP coding sequence. Inst ad, the 
negative effect on gene expression is an intrinsic 
property of the Bt ICP coding sequence. On this basis, 
it is believed that, by directed change of the DNA 
sequence of the Bt ICP coding region, an improvement of 
gene expression will occur. The improvement will be of 
a cumulative type as the negative influence of the Bt 
ICP coding region is spread over the complete coding 
sequence. Similarly, an improvement of gene expression 
will be obtained by reduction of the length of the Bt 
ICP coding sequence. This improvement will have a 
cumulative effect if used in combination with 
modifications of the Bt ICP coding region. 

Therefore, two types of modifications were 
introduced into a Bt ICP (i.e., btt) coding sequence 
which, as will be shown, indeed resulted in a 
significant increase in Bt ICP plant gene expression. 
First, the DNA sequence was modified in the central 
region of the toxic core fragment of the Bt ICP as 
transcription elongation is impaired in this region. 
Secondly, the length of the Bt ICP coding sequence was 
reduced as the negative influence is proportional to 
the length of the Bt ICP coding sequence, a detailed 
description of the mutations is given in figs. 6a, b 
and c. As shown in fig. 7, the modifications change the 
AT-content of the chimaeric Bt ICP gene significantly. 
The modifications change the primary DNA structure of 
the Bt ICP coding sequence without affecting the amino 
acid sequence of the encoded protein. It is evident 
that, if more DNA mutations were to be introduced into 



the Bt ICP coding sequence, a further improvement of 
gene expression would be obtained. 

To determine the effect of the modifications, the 
expression properties of the modified BtlCP gene and 
the parental bt860 - neo gene were compared in a 
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transi nt expr ssion system as d scribed by c rnelissen 
and Vandewiele (1989) and Denecke et al. (1989). 
Basically, the accumulation profiles of the genes under 
study were compared by relating their profiles to the 
5 profile of a reference gene present in the same 
experiment. Fig. 8a shows the vectors used in the 
assay, and fig. 8b shows that the accumulation of the 
reference CAT protein is nearly identical in both 
experiments. It is not possible to measure the 

10 accumulation of Bt ICP encoded by the parental 
bt860-neo gene, but the modified Bt ICP gene clearly 
directs an increased synthesis of Bt ICP. 

These results demonstrate that mutation of the Bt 
ICP coding sequence relieves the negative influence of 

15 the Bt ICP coding sequence on the expression of a Bt 
ICP plant gene. 

Example 6. Cloning and Expression of Modified BT ICP 
Genes in Tobacco and Potato Plants 

20 Using the procedures described in US patent 

application 821,582, filed January 22, 1986, and EPA 
86300291.1, EPA 88402115.5 and EPA 89400428.2, the 
modified Bt ICP (i.e., bt2) genes of figs. 6 and 7 are 
inserted into the intermediate T-DNA vector, pGSH1160 

25 (Deblaere et al., 1988) between the vector's T-DNA 
terminal border repeat sequences. 

To obtain significant expression in plants, the 
modified Bt ICP genes are placed under the control of 
the strong TR2* promoter (Velten et al., 1984) and are 

30 fused to the transcript 3' end formation and 
polyadenylation signals of ~12^T-DNA^ene"7 (Velten^and - 
Schell, 1985). 

In addition, the translation initiation context or 
site are changed in accordance with the Joshi consensus 
sequence (Joshi, 1987) in order to optimize the 
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translation initiati n in plant cells. To this end, an 
oligo duplex (figs. 6a and 6b) is introduced to create 
the following sequence at translation initiation site: 
AAAACCATGGCT . In this way, an additional codon (i.e., 
GCT) coding for alanine is introduced. Additionally, 
Kpnl and BstXI sites are created upstream of the ATG 
translation initiation codon. 

Using standard procedures (Deblaere et al., 1985), 
the intermediate plant expression vectors, containing 
the modified BtlCP gene, are transferred into the 
Aqrobacterium strain C58C1 Rif* (US patent application 
821,582; EPA 86300291.1) carrying the disarmed Ti- 
plasmid p6V2260 (Vaeck et al., 1987). Selection for 
spectinomycin resistance yields cointegrated plasmids, 
consisting of pGV2260 and the respective intermediate 
plant expression vectors. Each of these recombinant 
Aqrobacterium strains is then used to transform 
different tobacco plant cells ( Nicotian la tabacum ) and 
potato plant cells ( Solanum tuberosum ) so that the 
modified Bt ICP genes are contained in, and expressed 
by, different tobacco and potato plant cells. 

The transgenic tobacco plants containing the 
modified Bt ICP genes are analyzed with an ELISA assay. 
These plants are characterized by a significant 
increase in levels of Bt (Bt2) proteins, compared to a 
transgenic tobacco plant containing a non-modified Bt 
ICP (bt2) gene. 

The insecticidal activity of the expression 
products of the modified Bt ICP (bt2) genes in leaves 
of transformed tobacco and potato plants is evaluated 



by recording the growth rate and mortal ity~of larvae of" 
Tobacco hornworm (Manduca sexta ) , Tobacco budworm 
(Heliotis virescens) and potato tubermoth ( Phthorimaea 
operculella ) fed on leaves of these two types of 
plants. These results are compared with the growth rate 
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of larva fed 1 aves from tobacc and potat plants 
transformed with the unmodified or parental Bt ICP 
(bt2) gene and from ion trans formed potato and tobacco 
plants. Toxicity assays are performed as described in 
EPA 88402115.5 and EPA 86300291.1. 

A significantly higher mortality rate is obtained 
among larvae fed on leaves of transformed plants 
containing and expressing the modified Bt ICP genes. 
Tobacco and potato plants containing the modified Bt 
ICP genes show considerably higher expression levels of 
Bt ICPs compared to tobacco and potato plants 
containing the unmodified Bt ICP gene. 

The insecticidal activity of three transgenic 
tobacco plants containing the modified Bt ICP genes is 
determined against second and third instar larvae of 
Heliothis virescens . The control plant was not 
transformed. The results are summarized in Table 2, 
below. 

Table 2 



Plant % mortality of insects (recorded after 

5 days) 

Control 11 
No. 1 100 
No. 2 88.5 
No. 3 100 



Needless to say, this invention is not limited to 
tobacco_and_ppt_a_to _ plants_transf ormed with_the_modif ied 
Bt ICP gene. It includes any plant, such as tomato, 
alfalfa, sunflowers, corn, cotton, soybean, sugar 
beets, rapeseed, brassicas and other vegetables, 
transformed with the modified Bt ICP gene. 
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Nor is th invention limited to the us of 
Agrobacterium tumefaciens Ti-plasmids for transforming 
plant cells with a modified Bt ICP gene, other known 
techniques for plant transformation, such as by means 
of liposomes, by electroporation or by vector systems 
based on plant viruses or pollen, can be used for 
transforming monocotyledonons and dicotyledons with 
such a modified Bt ICP gene. 

Nor is the invention limited to the bt2 gene, but 
rather encompasses all Cry I, Cry II, Crylll and Cry IV 
Bt ICP genes. 
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Claims 

1. A process for modifying a Bt ICP gene to improve 
its expression in a plant cell, transformed with the 

5 gene; the process comprising the step of: changing A 
and T sequences in a plurality of translational codons 
of the gene to corresponding G and C sequences encoding 
the same amino acids, so as to improve the gene's 
transcription to an mRNA, the nuclear accumulation of 
10 the mRNA and/or the nuclear export of the mRNA, 
particularly the gene's transcription, in the plant 
cell. 

2. The process of claim 1 for modifying a Bt ICP gene 
15 to improve its transcription in plant cells, 

transformed with the modified gene, wherein the 
plurality of translational codons is at least one 
region of the gene which, during transcription, has 
thereon a relatively low percentage of RNA polymerase 
20 II of the plant cell as compared to another adjacent 
upstream region of the gene. 

3. The process of claim 1 or 2, wherein the Bt ICP 
gene encodes a Bt insecticidal crystal protein 
truncated towards a trypsin cleavage site, preferably 

25 at both its C-terminal and N-terminal ends, and 
preferably encoding a portion of the protein of about 
60 - 80 kDa, particularly the toxin of the protein. 

4. The process of anyone of claims 1-3, wherein A and 
30 T sequences of at least 3 codons are changed to G and C 

sequences at a translation initiation site of - the gene 
and A and T sequences of at least about 3, preferably 
at least about 10, especially at least about 33 , codons 
are changed to G and C sequences in a second region of 
the gene, preferably affecting transcription 
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elongati n, downstream of th translati n initiation 
site* 

5. The process of anyone of claims 1-4, wherein A and 
5 T sequences of at least about 3, preferably at least 

about 10, especially at least about 33, codons are 
changed to 6 and C sequences in a third region of the 
gene, preferably affecting cytoplasmic UNA 
concentration . 

10 

6. The process of claim 4 or 5, wherein A and T 
sequences of at least about 3 codons are changed to G 
and C sequences at a translation termination end of the 
gene* 

15 7. The process of anyone of claims 4-6, wherein the 
gene is a cry I gene, such as a bt2, btl4, bt!5 or bt!8 
gene, preferably a bt2 gene, or a gene having 
substantial sequence homology thereto. 

20 8. The process of claim 7 wherein the gene is a bt2 
gene; the second region being between about nucleotides 
674 and 1000 and A and T sequences of about 59 or more 
codons are changed to G and C sequences in the second 
region, particularly between about nucleotides 733 and 

25 1000, quite particularly between about nucleotides 765 
and 794. 

9. The process of anyone of claims 1-8, wherein the 
gene is further modified by substituting for its ATG 

3Q translation initiation site : AAAACCATGGCT. 

10. The modified Bt TCP gene obtained by the process 
of anyone of claims 1-9. 
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11. A chimaeric gene for transforming a cell of a 
plant, comprising the following operably-linked DNA 
fragments in the same transcriptional unit: 

5 a) the modified Bt TCP gene of claim 10; 

b) a promoter capable of directing expression of 
the modified Bt ICP gene in the plant cell; and 

c) transcript 3* end formation and polyadenylation 
10 signals suitable for expressing the modified Bt 

ICP gene in the plant cell. 

12. The plant cell of claim 11, transformed with the 
chimaeric gene of claim 11. 

15 

13. A plant, plant tissue or plant cell culture 
consisting of the plant cells of claim 12. 

14. A seed of the plant of claim 13. 

20 15. A vector, preferably a Ti-plasmid, for stably 
transforming the nuclear genome of a plant, comprising 
the chimaeric gene of claim 11. 

16. A process for protecting the plant of claim 10 
25 against an insect pest, comprising the step of: 

transforming the genome of the plant with the chimaeric 
gene of claim 11. 

17. A process for modifying a foreign gene whose rate 
and/or level of expression in a plant cell, transformed 

30 with the gene, is substantially limited by the rate 
and/or level of nuclear production of an mRNA encoded 
by the gene; the process comprising the step of: 
changing A and T sequences in a plurality of 
translational codons of the gene, particularly in a 
plurality of translational codons in at least one 
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region of the gen which, during transcription, has 
thereon a relatively low percentage of RNA polymerase 
II of the plant cell as compared to another adjacent 
upstream region of the gene; the A and T sequences 
5 being changed to corresponding G and C sequences 
encoding the same amino acids, so as to improve the 
gene's transcription to the mRNA, the nuclear 
accumulation of the mRNA and/or the nuclear export of 
the mRNA, particularly the gene's transcription to the 
10 mRNA, quite particularly the transcript elongation by 
RNA polymerase II on the gene, in the plant cell. 
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Figure 2c 
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RNA polymerase II migration over 
the bt2 coding sequence 
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riGURE 3 



Spliced PVE36.SEQ 1:8600 into PVE36.SEQ 
8600 Base Pairs 



PCT/EP91/00733 




,30,20 



1 : Aatll 

2 : AccI 

3 : Aflll 

4 : AlvOT 

5 : Apal 

6 : Avrll 

7 : BasHI 



8 : 

9 : 
10 
11 
12 
13 



Bell 
BspMTI 
: BssHII 
; BstELI 
; BstXI 
EagI 



14 : EcoNI 



15 
16 
17 
18 
19 
20 
21 



EcoRV 

Bco31I 

Espl 

Hindll 

Hindlll 

Kpnl 

Nael 



22 
23 
24 
25 
26 
27 
28 



ttarl 

Ndel 

Nhel 

Nsil 

PflttI 

PpuMI 

Pvul 



29 : KsrII 

30 : SacI 

31 : Spel 

32 : Sphl 

33 : Tth111I 

34 : Xcal 

35 : Xhol 
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OLIGOPS15 

GTACCAAAACCATGGCTATCGAGACCGGTTACACCCCAATCGATATCG 



OLIGOPS16 

ATCGATTGGGGTGTAACCGGTCTCGATAGCCATGGTTTTGGTACCGAT 
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3) OLIGOPS16', 4) OLIGOPS15, 



Name Base _ ++*nn%vrrc TATCGAGACC GGTTACACCC CAATCGAT 

3 I MCG S tSSaGACC GGTTACACCC CAATCGATAT CG 

C 0N X ATCGGTACCA AAACCATGGC TATCGAGACC GGTTACACCC CAATCGATAT CG 
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rcl«cc»ACCCAOTCTTAa«OA™^^ 

OlIQOli 

AGGGAAGCATCAGGAGCC 



CACACTTGATGGACATCCTTAACAGCATCACCATCTACACGGACGCT 



CACAGGGGAGAGTACTACTGGTCCGGGCACCAGATCATGGCTTCCCCTGTGGGGTTCTCGGGGCCAGAATTCG 



OLIG07 



TCTCCCCTG 
OLIG08 



TOACCO.OCOTOTACATOaTOATOCTOtTA.OCATCTOCATCAAOTOTOaCCTCCI 



GATGCTTCCCT 



oaATOCOCTCAOCCOACCCTOOCAAOCTACCGTCOAAOtTCTCTAACAOTGCO 
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synthe-9 Linear LENGTH - 343 

149 213 278 

77 148 212 277 

5 66 137 202 267 343 

1 65 78 136 201 266 339 

<— OLIGO10' «— -OLIG09' — <~ OLIG08 9 ««0LIGO7' <«OLIG06' < 

> OLIGOl > 0LIG02 > OLIG03 > OLIG04 > OLIG05 > 



Kame Base 

1 1 GATCCTCTAG AGACTGGATC AGGTACAACC AGTTCAGGAG GGAGTTAACC CTAACCGTGT 

10 1 CTCTAG AGACTGGATC AGGTACAACC AGTTCAGGAG GGAGTTAACC CTAACCGTGT 

CON 1 GATCCTCTAG AGACTGGATC AGGTACAACC AGTTCAGGAG GGAGTTAACC CTAACCGTGT 

1 61 TAG AC 

10 57 TAGACATCGT GTCCCTA 

2 1 ATCGT GTCCCTATTC CCGAACTACG ACAGCAGGAC GTACCCAATC CGAACCGTGT 
9 1 TTC CCGAACTACG ACAGCAGGAC GTACCCAATC CGAACCGTGT 

CON 61 TAGACATCGT GTCCCTATTC CCGAACTACG ACAGCAGGAC GTACCCAATC CGAACCGTGT 

2 56 CCCAGTTAAC CAGGGA 

9 44 CCCAGTTAAC CAGGGAGATC TACACCAA 

3 1 GATC TACACCAACC CAGTGTTAGA GAACTTCGAC GGTAGCTTCC 
8 1 CC CAGTGTTAGA GAACTTCGAC GGTAGCTTCC 

CON 121 CCCAGTTAAC CAGGGAGATC TACACCAACC CAGTGTTAGA GAACTTCGAC GGTAGCTTCC 

3 45 GAGGCTCGGC TCAGGGCATC G 

8 33 GAGGCTCGGC TCAGGGCATC GAGGGAAGCA TC 

4 1 AGGGAAGCA TCAGGAGCCC ACACTTGATG GACATCCTTA 
7 1 AGGAGCCC ACACTTGATG GACATCCTTA 

CON 181 GAGGCTCGGC TCAGGGCATC GAGGGAAGCA TCAGGAGCCC ACACTTGATG GACATCCTTA 

4 40 ACAGCATCAC CATCTACACG GACGCT 

7 29 ACAGCATCAC CATCTACACG GACGCTCACA GGGGAGA 

5 1 CACA GGGGAGAGTA CTACTGGTCC GGGCACCAGA 

6 1 GTA CTACTGGTCC GGGCACCAGA 

CON 241 "ACAGCATCAC^CATCTACACG-GAeGGTCACA-GGGGAGAGTA-CTACT-GGTCC-GGGCACCAGA 



5 35 TCATGGCTTC CCCTGTGGGG TTCTCGGGGC CAGAATTCG 

6 24 TCATGGCTTC CCCTGTGGGG TTCTCGGGGC CAGAATTCGG ATC 

CON 301 TCATGGCTTC CCCTGTGGGG TTCTCGGGGC CAGAATTCGG ATC 
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21 54 
GAT CCT CTA GAG ACT GGA TCA GGT ACA ACC AGT TCA GGA GGG AGT TAA CCC TAA 
S r Ser Arg Asp Trp lie Arg Tyr Asn Gin Phe Arg Arg Glu L u Thr Leu Thr 

81 108 
CCG TGT TAG ACA TCG TGT CCC TAT TCC CGA ACT ACG ACA GCA GGA CGT ACC CAA 
Val Leu Asp lie Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg Thr Tyr Pro lie 

135 162 
TCC GAA CCG TGT CCC AGT TAA CCA GGG AGA TCT ACA CCA ACC CAG TGT TAG AGA 
Arg Thr Val Ser Gin Leu Thr Arg Glu lie Tyr Thr Asn Pro Val Leu Glu Asn 

189 216 
ACT TCG ACG GTA GCT TCC GAG GCT CGG CTC AGG GCA TCG AGG GAA GCA TCA GGA 
Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly lie Glu Gly Ser lie Arg Ser 

243 270 
GCC CAC ACT TGA TGG ACA TCC TTA ACA GCA TCA CCA TCT ACA CGG ACG CTC ACA 
Pro His Leu MET Asp He Leu Asn Ser He Thr He Tyr Thr Asp Ala His* Arg 

297 324 
GGG GAG AGT ACT ACT GGT CCG GGC ACC AGA TCA TGG CTT CCC CTG TGG GGT TCT 
Gly Glu Tyr Tyr Trp Ser Gly His Gin He MET Ala Ser Pro Val Gly Phe Ser 



CGG GGC CAG AAT TCG GAT C 
Gly Pro Glu Phe Gly 
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Figure 6c 



PVE36.SEQ nucleotide 800 - 4000 



AAATGGATAAATAGCCTTGCTTCCTATTATATCTTCCCAAATTACCAATACATTACACTAGCATCTGAAT 

TTCATAACCAATCTCGATACACCAAATCGW^P^^^^^^^^^^^^^^^^^^^^^^^^^ 

TTATAATTG1TPAAGTAACCCTGAAGTAGAAGTATTAGGTGGAGAAAGAATAGAAACTGGTTACACCCCA 

ATCGATATTTCCTTGTCGCTAACGCAATTTCTTTTGAGTGAATTTGTTCCCGGTGCTGGATTTGTGTTAG 

GACTAGTTGATATAATATGGGGAATTTTTGGTCCCT CTCA ATGGGACG C ATTT CTTGT ACAA ATTG AACA 

GTTAATTAACCAAAGAATAGAAGAATTCGCTAGGAACCAAGCCATTTCTAGATTAGAAGGACTAAGCAAT 

CTTTATCAAATTTACGCAGAATCTTTTAGAGACTGGGAAG CAGAT CCT ACT AAT CC AG CATT A AG AGA AG 

AGATGCGTATTCAATTCAATGACATGAACAGTGCCCTTACAACCGCTATTCCTCTTTTTGCAGTTCAAAA 

TTATCAAGTTCCTCTTTTATCAGTATATGTTCAAGCTGCAAATTTACATTTATCAGTTTTGAGAGATGTT 

TCAGTGTTTGGACAAAGGTGGGGATTTGATGCCGCGACTATCAATAGTCGTTATAATGATTTAACTAGGC 

TTATTGGCAACTATACAGATCATGCTGTACGCTGGTACAATACGGGATTAGAGCGTGTATGGGGACCGGA 

TTCTAGAGATTGGATAAGATATAATCAATTTAGAAGAGAATTAACACTAACTGTATTAGATATCGTTTCT 

CTATTTCCGAACTATGATAGTAGAACGTATCCAATTCGAACAGTTTCCCAATTAACAAGAGAAATTTATA 

CAAACCCAGTATTAGAAAATTTTGATGGTAGTTTTCGAGGCTCGGCTCAGGGCATAGAAGGAAGTATTAG 

GAGTCCACATTTGATGGATATACTTAACAGTATAACCATCTATACGGATGCTCATAGAGGAGAATATTAT 

TGGTCAGGGCATCyUlATAATGGCTTCTCCTGTAGGGTTTTCGGGGCCAGAATTCACTTTTCCGCTATATG 

GAACTATGGGAAATGCAGCTCC^CAACAACGTATTGTTGCTCAACTAGGTCAGGGCGTGTATAGAACATT 

ATCGTCCACTTTATATAGAAGACCTTTTAATATAGG GAT AA ATA ATCAA CAACT ATCTGTT CTT G A CGGG 

ACAGAATTTGCTTATGGAACCTCCTCAAATTTGCCATCCGCTGTATACAGAAAAAGCGGAACGGTAGATT 

CGCTGGATGAAATACCGCCACAGAATAACAACGTGCCACCTAGGCAAGGATTTAGTCATCGATTAAGCCA 

TGTTTCAATGTTTCGTTCAGGCTTTAGTAATAGTAGTGTAAGTATAATAAGAGCTCCTATGTTCTCTTGG 

ATACATCGTAGTGCTGAATTTAATAATATAATTCCTTCATCACAAATTACACAAATACCTTTAACAAAAT 

CTACTAATCTTGGCTCTGGAACTTCTGTCGTrAAAGGACCAGGATTTACAGGAGGAGATATTCTTCGAAG 

AACTTCACCTGGCCAGATTTCAACCTTAAGAGTAAATATT A CTG C ACC ATT AT CA C A AAG AT ATCGGGT A 

AGAATTCGCTACGCTTCTACCACAAATTTACAATTCCATACATCAATTGACGGAAGACCTATTAATCAGG 

GG AATTTTTCAGCAACTATGAGTAGTGGGAGTAATTTACAGTCCGGAAGCTTTAGGACTGTAGGTTTTAC 

TACTCCGTTTAACTTTTCAAATGGATCAAGTGTATTTACGTTAAGTGCTCATGTCTTCAATTCAGGCAAT 

GAAGTTrATATAGATCGAATTGAATTTGTTCCGGCAGAAGTAACCTTTGAGGCAGAATATGATTTAGAAA 

GAGCACAAAAGGCGGTGAATGAGCTGTTTACTTCTTCCAATCAAATCGGGTTAAAAACAGATGTGACGGA 

TTATCATATTGATCAAGTATCCAATTTAGTTGAGTGTTTATCTGATGAATTTTGTCTGGATGAAAAAAAA 

GAATTGTCCGAGAAAGTCAAACATGCGAAGCGACTTAGTGATGAGCGGAAXXXXXCCTCGAGCTTGGATG 

GATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAAT 

CGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGAC 

CTGTCCGGTGCCCTGAATGAACTGGAGGACGAGGCAGCGCGGCTATCGTGG CTGGCCACG A CGG G CGTT C 

CTTGCGCAGCTGTGCPCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGG 

GCAGGATCTCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGG 

CTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGTA 

CTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGA 

ACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGC 

TTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGG ATT CATCG A CT GTGGCCGG CTG G GT GTG GCGG 

ACCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCG 

CTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAG 

TTCTTCTGACAGATCCCCCGATGAGCTAAGCTAGCTATATCATCAATTTATGTATTACACATAATATCGC 

ACTCAGTCTTTCATCTACGGCAATGTACCAGCTGATATAATCAGTTATTGAAATATTTCTGAATTTAAAC 

TTGCATCAATAAATTTATGTTTTTGCTTGGACTATAATACCTGACTTGTTATTTTATCAATAAATATTTA 

AACTATATTTCTTTCAAGATGGGAATTAACATCTACAAATTGCCTTTTCTT 

-T-he-A!EG— initiation codon and the TGA stop codon are underlined. 
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Figure 6c (oant.) 



PPS029.SEQ nucleotide 800 - 4000 



AAATGGATAAATAGCCTTGCTTCCTATTATATCTTCCCAAATTACCAATACATTACACTAGCATCTGAAT 

TTCATAACCAATCTCGATACACCAAATCGGTACCAAAACCATGGCTATCGAGACCGGTTA^ 

GATATTTCXOTGTCGCTAACGCAATTTCTTTTGAGTGAATTTGTTCCCGGTGCTGGATTTGTGTTAGGAC 

TAGTTGATATAATATGGGGAATTTTTGGTCCCTCTCAATGGGACGCATTTCTTGTACAAATTGAACAGTT 

AATTAACCAAAGAATAGAAGAATTCGCTAGGAACCAAGCCATTTCTAGATTAGAAGGACTAAGCAATCTT 

TATCyUUOTTACGCAGAATCTTTTAGAGAGTGGGAAGCAGATCCTACTAATCCAGCATTAAGAGAAGAGA 

TGCGTATTCAATTCAATGACATGAACAGTGCCCTTACAACCGCTATTCCTCTTTTTGCAGTTCAAAATTA 

TCAAGTTCCTCTTTTATCAGTATATGTTCAAGCTGCAAATTTACATTTATCAGTTTTGAGAGATGTTTCA 

GTGTTTGGACAAAGGTGGGGATTTGATGCCGCG ACT AT CA AT A GTCGTTAT A ATG ATTT AA CT A GG CTT A 

TTGGCAACTATAUGATCATGCICTACGCTGGTACAATACGGGATTAGAGCGTGTATGGGGACCGGATTC 

TAG AGACTGGATCAGGTACAACCAGTTCAGGAGGGAGTTAACCCTA A CCGTGTT AG ACATCGTGT CC CT A 

TTCCCGAACTAW5ACAGCAGGACGTACCCAATCCGAACCGTGTCCCAGTTAACCAGGGAGATCTACACCA 

ACCCAGTGTTAGAGAACTTCGACGGTAGCTTCCGAGGCTCGGCTCAGGGCATCGAGGGAAGCATCAGGAG 

CCCA(^CTTGATGGACATCCTTAACAGCATCACCATCTACACGGACGCTCACAGGGGAGAGTACTACTGG 

TCCGGGC^CC^GATCATGGCTTCCCCTGTGGGGTTCT(^GGGCCAGAATTCACTTTTCCGCTATATGGM 

CTATGGGAAATGCAGCTCC^CAACAACXSrATTGTTGCTC^ 

GTCCACTTTATATAGAAGACCTTTTAATATAGGGATA A AT AAT CA ACA ACTATCTGTT CTTG ACGGGA CA 

GAATTTGCTTATGGAACCTCCTCAAATTTGCCATCCGCTGTATACAGAAAAAGCGGAACGGTAGATTCGC 

TGGATGAAATACCGCCACAGAATAACAACGTGCCACCTAGGCAAGGATTTAGTCATCGATTAAGCCATGT 

TTCyUVTGITrCGTTC^GGCTTTAGTAATAGTAGTGTAAGTATAATAAGAGCTCCTATGTTCTCT 

CATCGTAGTGCTGAATTTAATAATATAATTCCTTCATCACAAATTACACAAATACCTTTAACAAAATCTA 

CTAATCTOGGCTCTGGAACTTCTGTCGTTAAAGGACCAGG ATTT ACAGGAG GAGAT ATTCTT CGA AG A A C 

TTCACCTGGCCAGATTTCAACCTTAAGAGTAAATATTACTGCACCATTATCACAAAGATATCGGGTAAGA 

ATTCGCTACGCTTCTACCACAAATTTACAATTCCATACATCAATTGACGGAAGACCTATTAATCAGGGGA 

ATTTTTCAGCAACTATGAGTAGTGGGAGTAATTTACAGTCCGGAAGCTTTAGGACTGTAGGTTTTACTAC 

TCCGTTTAACTTTTCAAATGGATCAAGTGTATTT A CGTT A AGTG CTCATGT CTTC AATT CA GG CA ATG A A 

GTTTATATAGATCGAATTGAATTTGTTCCGGCAGAAGTAACCTTTGAGGCAGAATATGATTTAGAAAGAG 

CACAAAAGGCGGTGAATGAGCTGTTTACTTCTTCCAATCAAATCGGGTTAAAAACAGATGTGACGGATTA 

TCATATTGATCAAGTATCCAATTTAGTTGAGTGTTTATCTGATGAATTTTGTCTGGATGAAAAAAAAGAA 

TTGTCCGAGAAAGTCAAACATGCGAAGCGACTTAGTGATGAGCGGAAXXXXXCCTCGAGCTTGGATGGAT 

TGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAATCGG 

CTGCTCTG ATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCG C CCGGTTCTTTTTGT CA AG ACCG A C CTG 

TCCGGTGCCCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCCTT 

GCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCA 

GGATCTCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATC CATCATGGCTGATGCA ATG CG GCGG CTG 

CATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGTACTC 

GGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATC AGGGGCT CG CG CC AG CCG A ACT 

GTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTG 

CCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACC 

GCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAG AG CTTG GCGGCG A ATGGG CTGA CCG CTT 

CCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCAT CG CCTT CTATCGCCTT CTTG ACG AGTT C 

TTCTGACAGATCCCCCGATGAGCTAAGCTAGCTATATCATCAATTTATGTATTACACATAATATCGCACT 

CAGTCTTTCATCTACGGCAATGTACCAGCTGATATAATCAGTTATTGAAATATTTCTGAATTTAAACTTG 

CATCAATAAATTTATGTTTTTGCTTGGACTATAATACCTGACTTGTTATTTTATCAATAAATATTTAAAC 

TATATTTCTTTCAAGATGGGAATTAACATCTACAAATTGCCTTTTCTTATCGACCATGTACGGGTACCGA 

GCTCGAATTCCTACGCAGCAGGTCTCATCAAGACGATCTACCCGAGTAACA 

the ATG initiation codon and the TGA stop codon are underlined. 
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Figure 8a 
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1 : Aatll 

2 : AccI 

3 : AflH 

4 : Alvn 

5 : Apal 

6 : Avrll 

7 : 3cL-H: 



8 : Bell 

9 : BspMTI 

10 : BSSHII 

11 : BstEII 

12 : BstXI 

13 : EagI 

14 : EcoNI 



15 : EcoRV 

16 : Eco31I 

17 : Espl 

18 : Hindi I 

19 : Hindlll 

20 : Kpnl 

21 : Nael 



22 : Narl 

23 : Ndel 

24 : Nhel 

25 : Nsil 

26 : PflMI 

27 : Ppcyj 

28 : Pvul 



29 : RsrII 

30 : Sad 

31 : Spel 

32 : Sphl 

33 : Tth111I 

34 : Xcal 

35 ; >2>oI 
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1 : Aatll 

2 : AccI 

3 : Aflll 

4 : AlvOT 

5 : Apal 

6 : Avrll 

7 : Ba.T*iI 



8 : Bell 

9 : Bse^II 

10 : BssHII 

11 t BstEII 

12 : BstXI 

13 : EagI 

14 : EcoNI 



15 : EcoRV 

16 : Bco31I 

17 : Espl 

18 : Hindll 

19 : HindTII 

20 : Kpnl 

21 : Nael 



22 : Narl 

23 : Ndel 

24 : Nhel 

25 : Nsil 

26 : PflMI 
21 : PpuXI 
28 : Pvul 



29 : RsrII 

30 : Sad 

31 : Spel 

32 : Spin 

33 : Tth111I 

34 : Xcal 

35 : Xhcl 
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